Market Intelligence Data Collection from Heterogeneous Sources with Similarity-Based Selection Clustering Technique Using Knowledge Maps: A Heuristic Approach
نویسندگان
چکیده
Business Intelligence (BI) has emerged as one of software solutions that have maximum allocated investments by many organizations for the year 2005. Among various forms and application-based business intelligence, market intelligence (MI) is viewed as a crucial factor for a company to succeed both operationally and strategically in today’s’ competitive environment. Capturing market intelligence data has apparently become easy, especially with the proliferation of the Web. But, this has made data collection more difficult in reality from the system’s point of view, as data sources on the web are voluminous, heterogeneous in terms of structures and semantics, and some part of it may be irrelevant to a specific organizations’ marketing decision-making context, which is the primary premises of market intelligence systems. To address these three specific problems, an algorithm based on similarity measures and multi-dimensional scaling (MDS), which produces hierarchical clusters of knowledge maps from a training data-source set for collecting inputs from heterogeneous sources for capturing market intelligence, is proposed in this paper. The paper illustrates that this algorithm can reduce irrelevant or highly similar data sources for inclusion in the selected data-source repository – represented in the form of clusters of knowledge maps. Therefore, it acts as a similarity-based selection and filtering tool also, with the specific purpose of data collection for MI. Incorporating more advanced techniques for Knowledge maps creation e.g. the Genetic Algorithm-based approaches can further expand this work.
منابع مشابه
A Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm
Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...
متن کاملStock Price Prediction using Machine Learning and Swarm Intelligence
Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...
متن کاملFeature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach
Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...
متن کاملSteel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps
Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...
متن کاملA Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset
Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...
متن کامل